Acquisition project | ElevenLabs
📄

Acquisition project | ElevenLabs

ElevenLabs is an Audio AI platform which helps creators expand their reach globally through realistic AI speech synthesis, professional voiceovers, perfect voice cloning and multilingual functionality.

Product

ElevenLabs is an audio AI platform to create the most realistic human sounding speed filled with emotions, intonation. They have a library of 3000+ voices and multilingual capability which can make any piece of audio content almost universally accessible. ElevenLabs has gained recognition for its high-quality, lifelike voice synthesis capabilities, which allow users to convert written text into natural-sounding spoken audio.


The product suite consists of

  • Speech synthesis - This is the flagship product that allows users to generate lifelike speech by synthesizing vocal emotion and intonation from text using a library of 3000+ voices or a custom voice that the user can themselves create (more details in appendix of this section). Their advanced Text to Speech (TTS) technology creates natural, engaging voices in multiple languages. This core technology is being used by publishers to create audiobooks or content companies to produce podcasts or game developers to give voice to characters in games or any content creator / blogger to add a listen to this article feature seamlessly
  • AI Dubbing Studio - Provides ability to localize content across 29 languages with AI dubbing. Translates audio and video while preserving the emotion, timing, tone, and unique characteristics of each speaker. ElevenLabs provides an end to end tool for changing anything in the workflow for managing the AI generated dubbing. Ideal for content creators looking to reach a global audience without losing the essence of the original audio.
  • Projects - End to end workflow for managing the creation of audiobooks / long form audio from long-form text.
  • Voiceover studio - User can upload a video and add AI generated voiceoover and sound effects. It gives users the ability to write a dialogue between any number of speakers, choose the speakers and intertwine sound effects anywhere they like
  • Conversational AI - Targetted at enterprises to create a voice agent with different characteristics and connection to a specified knowledge base which the voice agent should have access to. For example - a user could create a customer support agent, a sales agent, a math tutor etc with customized knowledge
  • ElevenLabs reader - while all other use cases are targetted at creators / B2B, this product is purely B2C which allows users to paste any link and listen to audio narration of any content. This is completely free and most likely a TOF enhancement exercise for ElevenLabs

In addition, ElevenLabs provides APIs for their Text to Speech technology which enterprises and developers can integrate into their existing systems for seamless automation of voice tasks.


In the absence of ElevenLabs, content creators had no ready tools to expand their reach into other languages using their own voice, emotion and intonation. Big creator led companies had to use professional dubbing services by human voice artists which was both expensive (~$100 / min including voice actors' fee, post-production and studio) and time consuming (>2 weeks for 10-min video requiring multiple functions; longer videos take months).


Users

The personas that are being targetted by ElevenLabs can be broken down into 5 distinct personas

  • Publishers:
    • Description: This group includes traditional publishing houses like HarperCollins, independent publishers like Lukeman Literary, audiobook platforms like Storytel, and platforms offering bite-sized learning content like AnyTopic.
    • Needs: They need cost-effective and scalable solutions for creating high-quality audiobooks in various languages. They are looking for tools that can speed up production time and offer flexibility in terms of voice selection and customization.
    • ElevenLabs Solution: ElevenLabs provides a suite of tools, including the "Projects" product5 and AI-powered narration capabilities, to meet these needs.
  • Content Creators:
    • Description: This group encompasses YouTubers and online educators like Never Too Small, Lutz Finger, and Leeanna Morgan, podcast producers like Audio Pitara, and video platforms like Kapwing and Aug X Labs.
    • Needs: They require high-quality AI voices for video narration, dubbing, podcasts, and educational content. They value realistic and emotionally expressive voices that enhance engagement and offer multilingual capabilities for wider reach.
    • ElevenLabs Solution: ElevenLabs provides a diverse library of AI voices, tools for voice cloning, and its text-to-speech technology in multiple languages.
  • Game Developers:
    • Description: This segment includes AAA game studios like Paradox Interactive, and indie developers like Magicave.
    • Needs: They need AI voices for character dialogue, narration, and dynamic audio content generation. They look for tools that can speed up the game development process, offer flexibility in voice creation, and allow for unique and immersive gameplay experiences.
    • ElevenLabs Solution: ElevenLabs provides AI voice technology for character voice creation and in-game narration, helping streamline game development processes.
  • Businesses:
    • Description: This group includes companies using AI for sales and customer support automation, like GAIL, Call Simulator, Infer.so, Thoughtly, and Synthflow.
    • Needs: They need natural-sounding and engaging AI voices for conversational AI applications like chatbots, virtual assistants, and automated call handling. They require tools for seamless integration with their existing systems and multilingual support for international operations.
    • ElevenLabs Solution: ElevenLabs provides high-quality AI voices, low latency voice models for real-time conversations20, and easy-to-use APIs for integration with various platforms.
  • AI Character Companies:
    • Description: This category comprises companies developing AI companions, like Shapes and Kindroid, and apps offering personalized experiences like SoundAiSleep.
    • Needs: They require lifelike and customizable AI voices to bring their virtual characters and personalized audio experiences to life. They seek solutions that offer high levels of voice quality, flexibility in customization, and seamless integration with their platforms.
    • ElevenLabs Solution: ElevenLabs provides its diverse voice library, voice cloning technology, and flexible pricing models that cater to startups and growing businesses.


Based on these personas, there are some common features that would feature in the ICP of ElevenLabs:

  • Companies and individuals looking to enhance their content or services with high-quality, human-like AI voices.
  • Value speed, scalability, and efficiency in audio production.
  • Require multilingual capabilities for wider reach and global operations.
  • Seek innovative solutions for creating engaging and immersive experiences for their target audience.


Important note - all personas except creators are B2B


B2B personas

Attribute / Persona

Publishers

Game Developers

Businesses

AI Character Companies

Name

HarperCollins, Lukeman Literary, Storytel, AnyTopic

Paradox Interactive, Magicave

GAIL, Call Simulator, Infer.so, Thoughtly, Synthflow

Shapes, Kindroid, SoundAiSleep

Company Size

Large to Medium

Medium to Large

Small to Medium

Small to Medium

Location

Global (US, UK, Europe)

Global (e.g., Sweden for Paradox Interactive)

Various (Global)

Various

Funding Raised

Established companies, some publicly traded; independents may have limited funding

Paradox Interactive: Publicly traded; Magicave: Early-stage funding

Early-stage startups with seed to Series A funding

Startups with seed to Series A funding

Industry Domain

Publishing, Audiobooks, E-learning

Video Game Development

AI for Sales, Customer Support Automation

AI Companions, Personalized Audio Experiences

Stage of the Company

Established to Growth Stage

Established (Paradox), Early-stage (Magicave)

Early-stage to Growth Stage

Early-stage

Organization Structure

Hierarchical with departments (Editorial, Production, Marketing)

Divided into studios, departments (Development, Audio, Design)

Flat to hierarchical structures depending on size; departments for tech, product, sales

Flat structure typical in startups

Need

Publishers need scalable and cost-effective solutions for producing high-quality audiobooks in multiple languages. ElevenLabs provides tools like AI-powered narration and the "Projects" feature to meet these needs.

Game Developers require AI voices for character dialogue and narration to enhance gameplay and speed up development. ElevenLabs offers AI voice technology that streamlines these processes.

Businesses focused on sales and customer support automation need natural-sounding AI voices for conversational applications. ElevenLabs provides high-quality AI voices and low-latency models suitable for real-time interactions.

AI Character Companies developing AI companions and personalized experiences need lifelike and customizable voices. ElevenLabs' voice library and cloning technology cater to these requirements.

Decision Maker

Head of Audio Production, CTO, CEO, Senior Executives

Game Directors, CTO, Head of Audio

CTO, Head of Product, CEO

CTO, Product Managers, CEO

Decision Blocker

Legal Department, Budget Committee, Quality Assurance

Budget Constraints, Technical Feasibility, Legal Compliance

Legal, Budget Constraints, Security Concerns

Budget Limitations, Technical Challenges

Frequency of Use Case

High; continuous audiobook production, content creation

Regular; aligned with game development cycles

High; ongoing customer interactions requiring AI voice

High; integral to product offering, daily use

Tools Utilized in Workspace

- Audio Editing Software (Pro Tools, Audacity)

- Content Management Systems

- Project Management Tools (Asana, Jira)

- Game Engines (Unity, Unreal)

- Audio Tools (FMOD, Wwise)

- Design Software

- CRM Systems (Salesforce)

- AI Platforms

- Communication Tools (Slack, Teams)

- AI Development Tools

- Voice Synthesis APIs

- Design Software

Organizational Goals

- Increase efficiency in audiobook production

- Expand multilingual offerings

- Reduce costs

- Enhance game experiences with AI voices

- Speed up development

- Reduce reliance on traditional voice actors

- Automate customer interactions

- Improve efficiency

- Enhance user experience

- Deliver lifelike AI companions

- Increase user engagement

- Innovate in AI experiences

Preferred Outreach Channels

Industry Conferences, Direct Sales Outreach

Industry Events, Networking

Tech Conferences, Online Marketing, Direct Sales

Startup Events, Tech Blogs, Online Marketing

Conversion Time

Medium to Long (3-6 months)

Medium (1-3 months), aligned with project timelines

Short to Medium (1-3 months), depending on company size and urgency

Short (less than 1 month), startups can make quick decisions

GMV

High (Significant revenue from book sales and audiobooks)

High (Revenue from successful game titles)

Varied; potential for high revenue depending on market adoption

Growing; potential for high revenue with successful user adoption

Growth of Company

Stable with a focus on digital innovation

Steady growth; emphasis on innovation

Rapid growth; scaling operations

Rapid growth; innovating in emerging market

Motivation

- Reduce costs and production time

- Stay competitive in digital market

- Reach global audiences

- Speed up development

- Enhance player experiences

- Expand global reach

- Automate processes

- Improve customer satisfaction

- Reduce operational costs

- Create unique user experiences

- Lead in AI innovation

- Increase user retention

Organization Influence

High influence in global publishing industry

Influential in gaming industry

Emerging influence in AI customer service market

Emerging influence in AI companion market

Decision Time

Medium to Long; decisions may require multiple approvals

Medium; decisions may be project-driven

Short to Medium; startups may decide quickly

Short; startups can make swift decisions

B2C Personas

AttributeContent Creators

Name

Never Too Small, Lutz Finger, Leeanna Morgan, Audio Pitara, Kapwing, Aug X Labs

Age

Typically between 25-45

Demographics

-

Gender:

Male and Female

-

Location:

Global (US, Europe, Asia)

-

Occupation:

YouTubers, Podcasters, Online Educators

-

Education Level:

Bachelor's Degree or higher

-

Income Level:

Moderate to High (depending on content success)

Need

- High-quality AI voices for video narration, dubbing, podcasts, educational content

- Multilingual capabilities for wider reach

Pain Point

- Limited resources for professional voiceovers

- Time-consuming and costly production processes

- Need to engage audiences effectively

Solution

ElevenLabs' AI voice library, voice cloning, and text-to-speech technology providing realistic and expressive voices in multiple languages

Behaviour

- Early adopters of technology

- Active on social media and content platforms

- Focused on audience engagement and growth

Perceived Value of Brand

- Innovative and cutting-edge

- Offers high-quality, realistic AI voices

- Cost-effective and time-saving solution

Marketing Pitch

"Enhance your content with lifelike AI voices from ElevenLabs, reaching global audiences effortlessly."

Goals

- Increase content quality and engagement

- Expand audience reach globally

- Produce content efficiently and cost-effectively

Frequency of Use Case

Frequent; utilized with each new content piece (videos, podcasts, courses)

Average Spend on Product

Moderate; willing to invest in tools that improve content quality and production efficiency

Value Accessibility to Product

- Requires user-friendly interfaces

- Affordable pricing plans

- Accessible customer support

Value Experience of Product

- High-quality and natural-sounding voice output

- Reliable performance

- Enhances overall content appeal


Prioritisation among the different customer profiles

Customer Profile

Adoption Curve

Frequency of Use

Appetite to Pay

Total Addressable Market (TAM)

Distribution Potential

Conclusion

Publishers

Slow
Traditional publishers often have lengthy decision-making processes due to organizational hierarchy, legal considerations, and risk aversion. Adoption of new technologies can be cautious and time-consuming.

High
Publishers consistently produce audiobooks and other audio content, requiring regular use of voice synthesis technology.

High
Publishers have significant budgets allocated for production. If they perceive clear value and ROI, they are willing to invest heavily.

Moderate
While the global publishing industry is substantial, the segment interested in AI voice technology is smaller, focusing on innovative publishers willing to adopt new methods.

Challenging
Reaching publishers typically requires direct sales efforts, industry networking, and can involve long sales cycles. Establishing trust and demonstrating value is essential.

Despite the high frequency of use and strong ability to pay, the slow adoption curve and challenging distribution make publishers a less optimal priority for immediate focus.

Content Creators

High
Content creators are often early adopters, eager to utilize new technologies to enhance their content and gain a competitive edge.

High
They produce content regularly, sometimes daily, requiring frequent use of voice synthesis tools.

Variable (Low to Medium)
Independent creators may have limited budgets, while successful creators or organizations may be willing to spend more.

Large
Millions of content creators exist globally across platforms like YouTube, podcasts, and e-learning.

High
They can be reached effectively through online marketing, social media, influencer partnerships, and scalable self-service platforms.

Content creators present a substantial opportunity with lower barriers to adoption and distribution. The large market size compensates for the variable appetite to pay, making them the top priority for focus.

Game Developers

Variable
Indie developers may quickly adopt new technologies, while larger studios have longer evaluation processes due to project planning and risk assessment.

Medium to Low
Use is often project-based, corresponding with game development cycles, which can span months or years.

Variable
Large studios have substantial budgets, but indie developers may have limited financial resources.

Moderate
The number of game development studios is smaller compared to content creators, but the industry has significant revenue.

Moderate
Requires targeted outreach at industry events, conferences, and through professional networks.

While there is potential for high-value contracts with large studios, the variable adoption rate, lower frequency of use, and moderate distribution challenges make game developers a secondary priority.

Businesses

Variable
Some businesses are innovative and adopt AI solutions early, but many are cautious due to concerns over customer experience, data security, and integration complexities.

High
Customer interactions occur daily, necessitating continuous use of voice technology.

High
Businesses are willing to invest in solutions that can reduce costs and improve customer satisfaction, especially if they can see clear ROI.

Large
There are numerous businesses across industries that could benefit from AI voice solutions.

Challenging
Selling to businesses often involves navigating complex procurement processes, reaching decision-makers, and dealing with longer sales cycles.

Businesses offer high revenue potential due to their appetite to pay and frequent use. However, the variable adoption curve and challenging distribution make them a secondary focus, requiring dedicated resources for targeted sales efforts.

AI Character Companies

High
These companies are at the forefront of AI technology and are eager to adopt advanced solutions to enhance their offerings.

High
AI voices are integral to their products, requiring constant use.

Variable (Low to Medium)
Many are startups with limited budgets but may prioritize spending on core technology.

Small
The niche market of AI character companies is relatively limited in size.

Moderate to High
Can be effectively reached through industry networks, startup communities, and targeted marketing.

While they are enthusiastic adopters and frequent users, the small market size and variable ability to pay limit the growth opportunities. They could be considered for future focus as the market expands.

Prioritisation conclusion summary

  • Primary Focus: Content Creators
    • Rationale: Content creators have a high adoption curve, frequent usage, a large market size, and high distribution potential despite variable appetite to pay.
    • Strategy: Leverage online channels, offer scalable solutions with flexible pricing, and focus on ease of use to attract this segment.
  • Secondary Focus: Businesses
    • Rationale: Businesses have a high appetite to pay and frequent usage but present challenges in adoption and distribution.
    • Strategy: Allocate dedicated resources for targeted outreach, emphasize ROI and efficiency gains, and address concerns about integration and security.


Market

Creator segment

For the creator segment of creating natural sounding voices and voice clones, conversion into multiple languages and expressive character voices

Players

Monthly Cost?

Pros

Cons

Best For...

ElevenLabs

From just $1 / Month

High-quality AI voice generation with multilingual capability.

Budget-friendly, diverse voice range, and multilingual

Professional level voice generation

Lovo AI

Basic starts at $24 / Month

Extensive voice options in 100+ languages, suited for script and video.

Variety of voices, clean UI

Variety of voices and settings

Play.ht

Paid options start at $31.20 / Month

Offers 900+ voices, ideal for narration, podcasts, and eBooks.

Emotionally adjustable voices, multilingual audio

Multilingual side-by-side audio

NaturalReaders

$99.50 for lifetime use

Simple TTS with platform-friendly cross-collaboration options.

Instant realistic voices, works on multiple platforms

Its platform-friendly interface

Narakeet

$0.20 per minute (confusing pricing)

Video-focused TTS with templates for video creation.

Video narration, easy to use

Built-in AI video creation tools

Fakeyou

Starts at $7 / Month (no free trial)

Character voices, including celebrities and fictional characters.

Unlimited TTS on every plan

Creating famous voices

Uberduck

Creator packages start at $96 / Year

Mimics celebrity voices, ideal for creative audio content.

API access for premium users, customizable pitch/speed

Customizable pitch and speed settings

Murf AI

$29 / Month

TTS with voice cloning and noise removal, ideal for professional use.

Background noise removal, volume/pitch adjustment

Removing background music

Business segment

For serving businesses which need advanced text to speech functionality with enterprise grade security

Feature

Total Number of Voices

Number of Languages

API Availability

Voice Cloning

AI Dubbing

Free Trial

ElevenLabs

1200+

29

PlayHT

600+

140+

Microsoft

400+

140+

Google

220+

40+

Amazon Polly

60

29

Speechify

130

30

Open AI

6

57


Feature

ElevenLabs

Alternatives

Language Support & Customization

1200+ voices in 29 languages, with customizable pitch and intonation. Offers VoiceLab for cloning and dubbing tools.

PlayHT, Microsoft, and Google TTS support many voices, but lack the same customization and emotional depth.

User Experience & Integration

Simple web-based text entry, with Projects and VoiceLab for bulk TTS. Full API available; lacks Android/Chrome apps.

Many require sign-up and cloud service registration; less user-friendly integration (e.g., Amazon, Microsoft).

Ease of Use

Easy for beginners and advanced users, with intuitive cloning features.

Slightly more complex due to required platform sign-ups.

Pricing and Licensing

Free plan for beginners; paid plans from $5/month to enterprise pricing. Each plan increases character count and features.

Varies by provider; most offer free trials or credits but generally lack ElevenLabs’ voice quality at similar prices.


Survey results show that ElevenLabs is the clear leader in terms of quality of text to speech.

Graph showing how many times each TTS provider was rated higher than all the others in the survey. In other words, it shows how many times it was ranked number one. 

image.png

Since the nearest competitor is Open AI, deepdiving into comparison with OpenAI

Feature

ElevenLabs Conv AI

OpenAI Realtime

Total Number of Voices

3k+

6

LLMs Supported

Bring your own server or choose from any leading provider

OpenAI models only

Call tracking and analytics

Yes, built-in dashboard

No, must build using API

Latency

1-3 seconds depending on network latency and size of knowledge base

Likely faster due to no transcription step

Price

10 cents per minute on business, as low as 2-3 cents per minute on Enterprise with high volume (+LLM cost)

~15 cents per minute (6 cents per minute input, 24 cents per minute output)

Voice Cloning

Yes, bring your own voice with a PVC

No voice cloning

API Access

Yes, all plans

Yes, all plans

The major differentiation is the flexibility provided by ElevenLabs to use any LLM with its text to speech model while a user has to use Open AI LLM for using their TTS service.



Appendix

There are overall 3 kinds of voices on the ElevenLabs

  • Default voices - available to all users. Have multilingual capability and different styles for different use cases. For example - Charlie is for conversational use cases and the voice is Australian, middle aged male; Alice is a British female, confident middle aged voice specialized for news
  • image.png
  • Generated voices created using voice creation tool where the voice can be customized basis age, gender and accent
  • Cloned voices made using Instant Voice Cloning (basis an audio recording of 4 minutes) or Professional Voice Cloning (basis a ~6 hour audio recording)






Focussing on creators, ElevenLabs is expanding the total addressable market due to making it much cheaper and faster to dub / modify / create synthetic audio than existing alternatives.

image.png

image.png

image.png

Source: ElevenLabs Pitch Deck https://drive.google.com/file/d/16p8InLz7fl4OV2LXHKbLSGlVP7X34uUm/view


Core Value Proposition

'Make it possible for creators to reach a global audience by speaking in their audience's language at lower cost and higher speed.'

'Human quality automated dubbing as SaaS'

The core value proposition will be experienced by the user once they upload a video and it seamlessly dubs into another language in their voice at the click of a button. A mini aha moment before this could be when the creator is able to effectively clone their voice and experience another language's audio in their own voice.

ElevenLabs provides a freemium model (10,000 credits per months) which equates to 10 min of free audio content features. However, for experiencing the dubbing studio or voice cloning, users have to take an entry level subscription. The hook for potentially taking the subscription would be the quality of audio switching possible in different languages and demo videos.



Prioritization framework

Channel

Cost

Flexibility

Effort

Speed

Scale

Budget

Product Integration

Low to Medium
- Minimal financial cost due to existing APIs.
- Possible partnership fees if applicable.

High
- APIs allow for quick adjustments and updates.
- Customizable integrations based on needs.

Low to Medium
- Low development effort with existing APIs.
- Effort required for collaboration with partners.

Medium
- Faster implementation due to low development effort.
- Quick deployment possible.

High
- Access to large user bases through integration with popular platforms.
- Scalable across multiple platforms.

Moderate
- Budget-friendly with justified costs due to potential reach.
- Lower than sustained Paid Ad campaigns.

Content Loops

Low
- Minimal financial cost.
- Relies on user-generated content and sharing mechanisms.

High
- Strategies can be adapted quickly based on engagement metrics and trends.
- Responsive to user feedback.

High
- Continuous effort to create engaging content.
- Requires monitoring and fostering community engagement.

Medium
- Can gain momentum over time.
- Potential for viral growth if content resonates.

High
- Potential for exponential reach through viral content.
- Amplifies reach via users' networks.

Low
- Cost-effective as it leverages creativity and engagement rather than financial investment.
- Suitable for limited budgets.

Organic

Low
- Minimal financial expenditure.
- Investment in time and resources for content creation and SEO.

Medium
- Adjustments can be made but may take time to reflect in results.
- Dependent on search engine indexing.

High
- Significant effort in creating high-quality content.
- Ongoing SEO optimization required.

Slow
- Takes time to build authority and see results.
- Typically several months to a year.

High
- Potential to reach a vast audience over time.
- Content ranks higher and gains more visibility with persistent effort.

Low
- Financial expenditure is minimal.
- Suitable for limited budgets but requires significant time investment.

Paid Ads

High
- Requires substantial investment for ad spend.
- Higher costs in competitive markets.

High
- Campaigns can be adjusted in real-time based on performance metrics.
- Flexible targeting options.

Medium
- Effort required to set up and manage campaigns.
- Platforms offer tools to simplify management.

Fast
- Immediate visibility and traffic once the campaign is live.
- Quick way to reach large audiences.

High
- Can reach large audiences depending on budget.
- Scalable with increased ad spend.

High
- Significant budget needed to sustain campaigns.
- May strain budgets especially in competitive niches.

Referral Program

Low to Medium
- Costs associated with incentives for referrals.
- Generally cost-effective as costs are tied to successful conversions.

Medium
- Programs can be adjusted, but changes may affect user perception.
- Requires careful management.

Medium
- Initial setup requires effort.
- Ongoing management can be streamlined with automation.

Medium
- Takes time for users to start referring others.
- Momentum builds over time with increased participation.

Medium
- Reach is dependent on existing user base and their networks.
- Potential for exponential growth if widely adopted.

Low to Medium
- Budget-friendly.
- Costs scale with growth and are tied to successful referrals.


Channel

Effectiveness for Content Creators

Effectiveness for Businesses

Overall Conclusion

Product Integration

Highly Effective
- Seamless integration into existing tools enhances user experience.
- Increases adoption by embedding in creators' workflows.
- Co-marketing with platforms amplifies reach.

Highly Effective
- Enhances product offerings by integrating AI voice technology.
- Fits into existing business workflows.
- Strengthens partnerships and adds value for customers.

Top Priority
With low development effort and high scalability, Product Integration offers significant advantages in reaching both Content Creators and Businesses. Aligns well with expansion goals and provides a competitive edge.

Content Loops

Highly Effective
- Encourages creators to generate and share content.
- Fosters a community around ElevenLabs.
- Enhances user engagement and retention.

Less Effective
- Businesses are less likely to engage in content loops.
- Requires different approaches like case studies or formal content sharing.

Second Priority
Highly effective for engaging Content Creators and fostering organic growth. Complements Product Integration by leveraging user engagement. Less impactful for Businesses but overall valuable.

Organic

Highly Effective
- Attracts creators searching for tools and tutorials.
- Builds brand authority and trust.
- Supports long-term engagement.

Moderately Effective
- Builds credibility and thought leadership.
- Supports branding over time.
- May not yield immediate results for Businesses.

Third Priority
Essential for long-term growth and brand building. Highly effective for attracting Content Creators. Supports other channels by providing valuable content and establishing authority.

Paid Ads

Effective
- Targets creators on platforms like YouTube and Instagram.
- Immediate results but requires high budget.
- Engaging creatives needed.

Effective
- Reaches Businesses via platforms like LinkedIn.
- Higher CPC but potentially higher ROI.
- Requires clear value propositions.

Lower Priority
While effective, the high cost and budget requirements make Paid Ads less favorable compared to channels with lower costs and high impact, especially given the low development effort of Product Integration.

Referral Program

Effective
- Leverages creators' networks.
- Encourages sharing among peers and followers.
- Dependent on existing user base size.

Less Effective
- Businesses are less influenced by referral programs.
- Longer sales cycles reduce impact.
- Trust and reputation are critical.

Supportive Channel
Beneficial for engaging Content Creators but less impactful for Businesses. Not prioritized over channels offering broader reach and immediate impact.


Top three acquisition channels for ElevenLabs to focus on are:

  1. Product Integration
  2. Content Loops
  3. Organic

These channels offer the best balance of cost-effectiveness, scalability, alignment with target customer profiles, and potential for rapid and sustainable growth. By leveraging existing APIs for Product Integration, fostering user engagement through Content Loops, and building long-term brand authority via Organic strategies, ElevenLabs can effectively expand its market reach among Content Creators and Businesses.

Highly optimized homepage for SEO

ElevenLabs’ homepage generates the vast majority of their traffic volume, which is worth over $134,935. That doesn’t change by much when you look at the traffic value, either. That’s an unusual profile for a site of its size. Their organic traffic comes from search terms such as “elevenlabs”, “eleven labs”, and “11 labs” with a combined 91,000 searches monthly, indicating that people have already heard about ElevenLabs and are looking for them specifically — not generic terms like “AI dubbing”. But it isn’t all referrals from other sites or people searching for their brand name that have led to people knowing about ElevenLabs. Their homepage ranks in the top three for high-volume search terms like “ai voice generator” (104,000 searches monthly), “voice ai” (52,000 searches monthly), and “ai voice” (41,000 searches monthly). ElevenLabs built a homepage that topped the SERPs for every search related to their name — that allows them to rely on their homepage as the main source of organic traffic. (Source: https://foundationinc.co/lab/elevenlabs-journey)

image.png


However, there are certain keywords where ElevenLabs was not featuring on the first page of google rankings. These need to be refined

  • Text to realistic voice
  • AI voice bot


For ElevenLabs, leveraging content loops can amplify the reach of its AI voice technology by encouraging users to create and share content that showcases the product's capabilities. This not only promotes brand awareness but also creates a self-sustaining cycle of user-generated content that attracts new users. The incentive for the user to share content and credit ElevenLabs for increasing TOFU will be increasing the credit limit of the user's current plan (platform currency). This further creates a loop of higher usage of the platform since higher availability of credits.

Few ideas for content loops

Idea 1: AI-Dubbed Video Sharing with Free Credits Incentive

Hook: Transform your videos with realistic AI dubbing in multiple languages and earn free credits for sharing and crediting ElevenLabs.

Content Creator: YouTubers, social media influencers, video content creators.

Distribution Channel: YouTube, Instagram, TikTok, Facebook, content creator networks.

Incentive Mechanism:

  • Free Credits for Sharing: Users receive additional free credits when they share their AI-dubbed videos on social media and credit ElevenLabs.
  • Reward Criteria:
    • Include a specific hashtag (e.g., #DubbedWithElevenLabs).
    • Tag ElevenLabs' official account.
    • Provide a link to ElevenLabs in the video description or post.

How It Works

  1. Content Creation:
    • Creators use ElevenLabs to dub their videos into multiple languages using AI voices.
    • They enhance their content's accessibility and global reach.
  2. Sharing and Earning Credits:
    • Creators share the dubbed videos on their channels.
    • By crediting ElevenLabs as per the criteria, they earn free credits added to their accounts.
    • These credits can be used for future dubbing projects, encouraging continued use.
  3. Audience Engagement:
    • Viewers watch the high-quality dubbed videos and engage through likes, comments, and shares.
    • The inclusion of ElevenLabs branding raises awareness among viewers.
  4. Viral Spread and Adoption:
    • Other creators notice the quality and the incentive program.
    • Attracted by the opportunity to enhance their content and earn free credits, new users sign up.
  5. Positive Feedback Loop:
    • Increased usage leads to more sharing, earning more credits, and further usage.
    • The cycle reinforces itself, driving platform growth.


Idea 2: Voice Clone Challenges with Free Credits Incentive

Hook: Participate in the Voice Clone Challenge, share your AI-generated speech, and earn free credits on ElevenLabs.

Content Creator: AI enthusiasts, social media users, tech-savvy individuals.

Distribution Channel: TikTok, Instagram Reels, Twitter, Facebook, online forums.

Incentive Mechanism:

  • Free Credits for Participation: Users receive free credits when they share their voice clone content and credit ElevenLabs.
  • Bonus Credits:
    • Additional credits for posts that achieve high engagement (e.g., a certain number of likes or shares).

How It Works

  1. Voice Cloning and Content Creation:
    • Users create a voice clone using ElevenLabs.
    • They generate an audio recording of themselves reading a famous speech, quote, or a creative script.
  2. Sharing and Incentivization:
    • Participants share their recordings on social media platforms.
    • They must use a designated hashtag (e.g., #MyVoiceELevenLabs) and tag ElevenLabs to qualify for free credits.
    • Upon sharing, free credits are added to their accounts.
  3. Community Engagement:
    • The challenge encourages friendly competition and community interaction.
    • Users engage with each other's posts, fostering a sense of community.
  4. Amplification Through Rewards:
    • Posts with higher engagement receive additional credits, motivating users to create compelling content.
    • The prospect of earning more credits incentivizes users to promote their posts.
  5. Attracting New Users:
    • The visibility of the challenge draws in new users interested in voice cloning and earning rewards.
    • New users join the platform and participate, further fueling the loop.


Idea 3: Educational Content Creation with Free Credits Incentive

Hook: Enhance your educational content with AI voices and earn free credits by sharing and crediting ElevenLabs.

Content Creator: Educators, online instructors, educational content creators.

Distribution Channel: E-learning platforms, YouTube educational channels, social media.

Incentive Mechanism:

  • Free Credits for Content Sharing: Educators receive free credits when they share their AI-narrated educational content and credit ElevenLabs.
  • Referral Bonuses:
    • Additional credits for referring other educators who start using ElevenLabs.

How It Works

  1. Content Creation:
    • Educators use ElevenLabs to create engaging lessons, tutorials, or courses.
  2. Sharing and Crediting:
    • They publish their content on educational platforms or social media.
    • By crediting ElevenLabs (e.g., mentioning in the video, description, or using a hashtag like #TeachingWithElevenLabs), they earn free credits.
  3. Learner Engagement:
    • High-quality, AI-narrated content enhances learner experience.
    • Students share and recommend the content, increasing its reach.
  4. Referral Incentive:
    • Educators are encouraged to refer peers.
    • For each referral that results in a new user, they earn additional free credits.
  5. Community Growth:
    • A network of educators using ElevenLabs forms, sharing resources and best practices.
    • Continuous sharing leads to more free credits and usage.
  6. Positive Loop Effect:
    • The incentives motivate ongoing content creation and sharing.
    • The loop strengthens as more educators join and contribute.


Idea 4: AI Voice Meme Creation with Free Credits Incentive

Hook: Create viral memes using AI-generated voices and earn free credits when you share and credit ElevenLabs.

Content Creator: Meme creators, social media enthusiasts, influencers.

Distribution Channel: Reddit, Twitter, TikTok, Instagram, meme communities.

Incentive Mechanism:

  • Free Credits for Sharing Memes: Users earn free credits for each meme shared that credits ElevenLabs.
  • Viral Bonus:
    • Additional credits for memes that achieve viral status (e.g., reach a certain number of shares or impressions).

How It Works

  1. Meme Creation:
    • Users create memes using AI-generated voices from ElevenLabs.
    • They combine audio with visuals to produce engaging content.
  2. Sharing and Crediting:
    • Memes are shared on social media platforms.
    • By including a hashtag like #MemesWithElevenLabs and tagging ElevenLabs, users earn free credits.
  3. Community Engagement:
    • Memes attract attention and are shared widely.
    • High-engagement memes can earn users additional free credits.
  4. Trend Amplification:
    • As the trend grows, more users are inspired to create their own AI voice memes.
    • The prospect of earning free credits encourages participation.
  5. Expanding User Base:
    • The visibility of the memes introduces ElevenLabs to a broader audience.
    • New users sign up to create content and earn rewards.
  6. Sustained Engagement:
    • Regular creation and sharing of memes keep the loop active.
    • Users remain engaged with the platform to continue earning credits.


Strengthening the Positive Feedback Loop with Free Credits


Motivation to Engage:

  • Immediate Reward: Free credits provide an instant benefit for users' actions, reinforcing positive behavior.
  • Increased Usage: Free credits lower the barrier to experimenting with ElevenLabs' features, leading to more content creation.
  • Value Perception: Users perceive higher value in the platform when they receive tangible rewards.


Cycle Enhancement:

  1. Incentive Leads to Sharing: Users are more likely to share content to earn free credits.
  2. Sharing Increases Visibility: Content reaches a wider audience, raising awareness of ElevenLabs.
  3. New Users Attracted: Others see the content and incentives, prompting them to join.
  4. More Users Create Content: The user base grows, and more content is generated.
  5. Loop Intensifies: The cycle repeats with amplified effects due to the growing community.





Identification of complementary products

Content creators

CategoryTools

Video Editing Software

Adobe Premiere Pro, Final Cut Pro, DaVinci Resolve, iMovie

Audio Editing Software

Adobe Audition, Audacity, Logic Pro X

Streaming & Recording Software

OBS Studio, Streamlabs OBS, XSplit

Content Platforms

YouTube, Instagram, TikTok, Twitch

Mobile Content Creation Apps

InShot, KineMaster, CapCut

Design Tools

Canva, Adobe Photoshop, Adobe After Effects

Businesses

Category

Tools

Customer Support Platforms

Zendesk, Intercom, Freshdesk

CRM Systems

Salesforce, HubSpot, Zoho CRM

E-Learning Platforms

Moodle, Canvas, Blackboard

Collaboration Tools

Microsoft Teams, Slack, Zoom

Marketing Automation Tools

Mailchimp, Marketo, HubSpot Marketing Hub


Prioritization of product integrations

We will use the following framework to shortlist which partner integrations to pursue:

  • Time to Go Live: Estimated time required to develop and launch the integration.
  • Tech Effort: Level of technical resources and complexity involved.
  • New Users We Can Get (Monthly): Estimated number of new users the integration can bring in per month.

An additional consideration to select the right integration partner is if the integration adds any value to the user flow in the product. That will be the final filter we apply to shortlist the right kind of partner integrations.

Content Creator Tools

Integration PartnerTime to Go LiveTech EffortNew Users (Monthly)

InShot

Low (1-2 months)

Low

High (10,000+)

OBS Studio

Low (1-2 months)

Low to Medium

High (8,000+)

CapCut

Medium (2-3 months)

Medium

High (10,000+)

Audacity

Low (1-2 months)

Low

Medium (5,000+)

Adobe Premiere Pro

High (6-9 months)

High

High (10,000+)

DaVinci Resolve

Medium (3-4 months)

Medium

Medium (7,000+)

Final Cut Pro

High (6-9 months)

High

Medium (6,000+)

iMovie

Medium (3-4 months)

Medium

Medium (5,000+)

Business Tools

Integration PartnerTime to Go LiveTech EffortNew Users (Monthly)

Slack

Medium (2-3 months)

Medium

Medium (4,000+)

Zendesk

Medium (3-4 months)

Medium

Medium (3,000+)

Moodle

Low (1-2 months)

Low

Medium (2,000+)

Salesforce

High (6-9 months)

High

High (5,000+)

Microsoft Teams

High (6-9 months)

High

High (5,000+)


Basis the above framework, we can shortlist 3 integrations

  • InShot
    • Time to Go Live: Low
    • Tech Effort: Low
    • New Users: High
  • OBS Studio
    • Time to Go Live: Low
    • Tech Effort: Low to Medium
    • New Users: High
  • CapCut
    • Time to Go Live: Medium
    • Tech Effort: Medium
    • New Users: High


Although integrations with the likes of Youtube, Instagram, Adobe will be a huge value add for ElevenLabs but given that will be the longest lead time since a partnership with such big enterprises is very hard, those integrations have been deprioritized.


Customer journey and integration flow

Inshot

Customer Journey Map :

  1. Discovery: User updates or downloads InShot, now featuring ElevenLabs integration.
  2. Content Creation: User opens InShot to create or edit a video.
  3. Encounter with ElevenLabs: Within the audio options, the user sees "Add AI Voiceover by ElevenLabs."
  4. Value Addition: Users can generate high-quality voiceovers directly within the app, enhancing their videos without leaving the platform.
  5. Engagement: The seamless experience encourages users to use the feature regularly.


Integration Flow

  1. Access ElevenLabs Feature: - User selects "Music" or "Audio" within InShot.
  2. Select "AI Voiceover by ElevenLabs": A new option appears under audio sources.
  3. Input Text: User types or pastes the script for the voiceover.
  4. Choose Voice Settings: Selects language, voice type, tone, and style.
  5. Preview Audio: User listens to a preview and makes adjustments if necessary.
  6. Insert Voiceover: The AI-generated voiceover is added to the video timeline.
  7. Finalize and Share: User completes editing and exports the video.


OBS Studio

Customer Journey Map:

  1. Plugin Installation: User downloads the ElevenLabs Plugin from the OBS plugin repository.
  2. Setting Up: User configures the plugin within OBS for live streaming or recording.
  3. Value Addition: Users can add AI-generated voiceovers or announcements in real-time during streams.
  4. Engagement: Enhanced streams attract more viewers, benefiting the user and ElevenLabs.


Integration flow:

  1. Install Plugin: User downloads and installs the ElevenLabs plugin.
  2. Configure Settings: Accesses plugin settings to authenticate and select preferences.
  3. Add ElevenLabs Source: Adds a new audio source in OBS called "ElevenLabs AI Voice."
  4. Input or Trigger Text: Sets up text inputs or triggers for live voice generation.
  5. Live Use: During streaming, the AI voice can be activated as needed.


CapCut

Customer Journey Map:

  1. Discovery: User opens CapCut, which now includes ElevenLabs integration.
  2. Content Creation: User edits a video and accesses "AI Voiceover by ElevenLabs."
  3. Value Addition: Ability to add professional voiceovers enhances the quality of short-form videos.
  4. Engagement: Users are more likely to share high-quality content, increasing visibility for both CapCut and ElevenLabs.


Integration Flow:

  1. Access ElevenLabs Feature: Within CapCut's audio options, user selects "AI Voiceover by ElevenLabs."
  2. Input Text and Settings: Similar to InShot, user inputs text and selects voice parameters.
  3. Add to Timeline: The voiceover is inserted into the video.
  4. Edit and Share: User finalizes the video and shares it directly to platforms like TikTok.




Brand focused courses

Great brands aren't built on clicks. They're built on trust. Craft narratives that resonate, campaigns that stand out, and brands that last.

View all courses

All courses

Master every lever of growth — from acquisition to retention, data to events. Pick a course, go deep, and apply it to your business right away.

View all courses

Explore foundations by GrowthX

Built by Leaders From Amazon, CRED, Zepto, Hindustan Unilever, Flipkart, paytm & more

View All Foundations

Crack a new job or a promotion with the Career Centre

Designed for mid-senior & leadership roles across growth, product, marketing, strategy & business

View All Resources

Learning Resources

Browse 500+ case studies, articles & resources the learning resources that you won't find on the internet.

Patience—you’re about to be impressed.